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ABSTRACT 

DNA methyltransferases catalyse the transfer of 
a methyl group from the ubiquitous cofactor 
S-adenosyl-L-methionine (AdoMet) onto specific 
target sites on DNA and play important roles in or- 
ganisms from bacteria to humans. AdoMet analogs 
with extended propargylic side chains have been 
chemically produced for methyltransferase-directed 
transfer of activated groups (mTAG) onto DNA, 
although the efficiency of reactions with synthetic 
analogs remained low. We performed steric engin- 
eering of the cofactor pocket in a model DNA 
cytosine-5 methyltransferase (C5-MTase), M.Hhal, 
by systematic replacement of three non-essential 
positions, located in two conserved sequence 
motifs and in a variable region, with smaller 
residues. We found that double and triple replace- 
ments lead to a substantial improvement of the 
transalkylation activity, which manifests itself in a 
mild increase of cofactor binding affinity and a 
larger increase of the rate of alkyl transfer. These 
effects are accompanied with reduction of both 
the stability of the product DNA-M.Hhal-AdoHcy 
complex and the rate of methylation, permitting 
competitive mTAG labeling in the presence of 
AdoMet. Analogous replacements of two conserved 
residues in M.Hpall and M2.Eco31l also resulted in 
improved transalkylation activity attesting a general 
applicability of the homology-guided engineering to 
the C5-MTase family and expanding the repertoire 
of sequence-specific tools for covalent in vitro and 
ex vivo labeling of DNA. 



INTRODUCTION 

The most abundant covalent modification of DNA is 
cytosine methylation which serves to expand the informa- 
tion content of the genome in organisms from bacteria to 
humans. In higher vertebrates, it is part of an intricate 
epigenetic regulatory network (1,2). DNA methylation 
patterns are brought about by DNA methyltransferases 
(MTases) — a class of enzymes that transfer a methyl 
group from the ubiquitous cofactor S-adenosyl-L- 
methionine (AdoMet) onto predefined target sites on 
DNA (Figure 1). Three classes of DNA MTases are 
known which modify the exocyclic amino group of 
adenine, the exocyclic amino group of cytosine or the 
carbon at the 5 position of cytosine (C5-MTases), respect- 
ively. DNA C5-MTases are found both in prokaryotes 
and eukaryotes and share a common mechanism of cataly- 
sis which involves flipping of the target cytosine from the 
DNA helix to the active site followed by its covalent acti- 
vation (3). The enzymatic catalysis involves transient for- 
mation of a Michael adduct between the catalytic cysteine 
and C6 of the cytosine, permitting a direct S N 2 transfer of 
the methyl group onto C5. These enzymes also share a set 
of conserved sequence motifs (I-X) and a large intervening 
variable region, which fold to make the catalytic and 
target recognition domains, respectively (4). The bacterial 
Hhal C5-MTase (M.Hhal), which recognizes the sequence 
GCGC and methylates the inner cytosine (underlined), 
serves as a structural and mechanistic paradigm for this 
class of enzymes (5). 

Most, if not all, bacterial and archaeal DNA MTases 
exhibit clearly defined sequence and base specificity. To 
expand the practical utility of this highly specific enzym- 
atic reaction, two major classes of synthetic AdoMet 
analogs have been developed (6). One such design is 
based on exploiting highly reactive aziridine (7) or 
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Figure 1. Alkylation reactions catalysed by the AdoMet-dependent methyltransferases. (a) General reaction scheme for enzymatic transfer of methyl 
or extended groups from natural cofactor AdoMet (1) or its analogs (2-4) onto DNA. (b) Catalytic mechanism of C5-alkylation of cytosine catalysed 
by DNA cytosine-C5 methyltransferases. Residue numbers correspond to those of M.Hhal. 



N-mustard (8) chemistries, which lead to coupling of a 
whole cofactor molecule to the target DNA (named 
Sequence-specific Methyltransferase-Induced Labeling, 
SMILing). Subsequently, cofactors with activated 
sulfonium-bound side chains have been produced, which 
permit targeted transfer of these linear side chains alone 
(named methyltransferase-directed Transfer of Activated 
Groups, mTAG) (9,10). Both chemistries have been used 
to demonstrate the potential utility of this approach for a 
number of DNA manipulations such as nanoparticle 
design (11), in-cell DNA tracking (12) or optical genome 
mapping (13,14). In the mTAG series, cofactors with 
extended propargylic side chains were chemically 
synthesized (Figure 1) and examined in enzymatic reac- 
tions with all three types of DNA methyltransferases 
(9,10). However, the efficiency of transalkylations 
observed with the wild-type C5-MTases proved insuffi- 
ciently high for routine applications. 

In this work, we performed steric engineering of the 
cofactor pocket in the Hhal DNA cytosine-5 methy- 
ltransferase by systematic replacement of three non- 
essential positions with shorter residues in conserved 
sequence motifs IV and X and in the variable region. 
We found that double and triple replacements confer sub- 
stantial improvements of the transalkylation activity and a 
reduction of the methyltransferase activity in M.Hhal 
and, to a smaller degree, in related M2.Eco31I and 
M.Hpall C5-MTases recognizing the CCGG and 
GGTCTC target sites, respectively. Binding and kinetic 
studies of M.Hhal mutants showed that these replace- 
ments substantially enhance the rate of alkyl transfer 
and reduce the enzyme affinity toward the natural 
cofactor AdoMet and its product AdoHcy. Importantly, 



the engineered MTases can efficiently utilize such synthetic 
analogs in the presence of AdoMet, and the modified 
DNA is not degraded by methylation-dependent restric- 
tion systems in bacteria. These findings open new ways for 
targeted covalent deposition of reporter groups onto 
DNA for a variety of ex vivo and in vivo applications. 

MATERIALS AND METHODS 

Recombinant DNA WT and mutant MTases M.Hhal 
were produced as previously described (15). 
Recombinant DNA MTases M2.Eco31I and M.Hpall 
were expressed as HisTaged proteins and purified accord- 
ing manufacturer recommendations (Supplementary 
Data). HPLC-purified oligodeoxyribonucleotides were 
obtained from MWG (Germany). McrBC endonuclease 
was purchased from New England Biolabs. 

Synthesis of AdoMet analogs 

Chemical synthesis and purification of AdoMet analogs 
were performed as previously described by direct 
chemoselective ^-alkylation of 5-adenosyl-L-homocy- 
steine (9). Cofactor 2 was obtained as a diasteromeric 
mixture of R,S- and S.S-isomers starting from 
butyn-2-ol-l (16). Cofactor 3 was synthesized starting 
from pentyn-2-ol-l and purified to an over 85% chiral 
purity of the enzymatically active S.S-isomer (17). For 
analog 4, the alkylation was performed with 7V-BOC- 
protected 6-(4-aminobutanamido)hex-2-yn-l-yl 4-nitro- 
benzenesulfonate, which was obtained in four steps from 
5-chloropentyne-l following previously described proced- 
ures (10), and the S.S-diastereomer was isolated by 
reversed-phase HPLC. 
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DNA protection assays 

DNA protection assays were performed essentially as pre- 
viously described (9). Briefly, 2-fold serial dilutions of 
DNA MTases were incubated with phage lambda 
(M.Hhal and M.Hpall, 0.6-1.2 ug per reaction) or 
pUC19 (M2.Eco31I, 1.0 ug) DNA in a corresponding 
reaction buffer containing 300 uM cofactors at 37°C for 
1^4 h. MTase reaction buffers were as follows: M.Hhal: 
50 mM Tris-HCl, 15mM NaCl, 0.5 mM EDTA, 2mM 
2-mercaptoethanol, 0.2mg/ml BSA, pH 7.4; M2.Eco31I: 
20 mM MOPS, 20 mM Tris-HCl; 20 mM CAPS, 15mM 
NaCl, 0.5 mM EDTA, 0.2 mM 2-mercaptoethanol, 
0.2mg/ml BSA, pH 8.0; M.Hpall: 20 mM MES, 20 mM 
HEPES, 30 mM NaCl, 0.5 mM EDTA, 0.2 mM 2-mer- 
captoethanol, 0.2mg/ml BSA, pH 7.0. After incubation 
the reactions were stopped by heating to 80° C for 10 
min and the modified DNA was challenged with 10-12 u 
of corresponding restriction endonuclease for 1 h at 37°C. 
M.Hhal modification was assessed with R.Hin6I, 
M2.Eco31I— with R.Eco31I and R.PvuII and 
M.Hpall — with R.Hpall. Samples were analysed by 
agarose gel (1-1.5%) electrophoresis. Full protection of 
substrate DNA from endonuclease cleavage in reaction 
in which the MTase is present in an iV-fold dilution 
relative to its target sites indicates that the enzyme 
carried out at least 7Y turnovers; the turnover rate was 
calculated by dividing the number of turnovers by 
reaction time t (k obs > N/t) 

Single-turnover kinetics 

Reactions were performed using 596 bp DNA fragment 
with a single M.Hhal target site as a substrate. The 
reaction components in syringes A (200-1000 nM 
M.Hhal plus 100 or 200 nM DNA) and B (50 uM 
cofactor) were mixed rapidly in M.Hhal reaction buffer 
at 37°C, and after a specified period the reaction was 
quenched with 6M guanidinium chloride in a Rapid- 
Quench-Flow instrument RQF-3 (KinTek). Modified 
DNA was precipitated with propanol-2, dissolved in 
30 ul of water and treated with 10 units of R.Hin6I. 
DNA fragments were analysed by 1.5% agarose gel elec- 
trophoresis and visualized by scanning with 473 nm laser 
after staining with ethidium bromide (EtBr). Bands inten- 
sity was quantified using Multi gauge v3.0 software. 
Normalized progress curves were fitted to single exponen- 
tial equations using GraFit 5 (18). 

Analysis of cofactor binding 

0.2 uM M.Hhal was preincubated with 0.3 uM duplex 
oligonucleotide I (Supplementary Table SI) and then 
titrated by incremental addition of AdoHcy, AdoMet or 
its analogs in M.Hhal reaction buffer with no BSA. 
Tryptophan fluorescence quenching upon cofactor 
binding (19) was measured at 25° C on PerkinElmer 
LS50B spectrofluorimeter at an excitation wavelength of 
290 nm (slit width 5nm) and emission wavelength of 
350 nm (slit width 7.5 nm). Titration data (Supplementary 
Figure S3) were analysed with the equilibrium solver 
routine of GraFit 5 (18). 



Chromatographic analysis of DNA composition 

Analyses were performed essentially as described previ- 
ously (9). M.Hhal (12.5 uM), oligonucleotide duplex 
I (Supplementary Table SI) and AdoMet or its analogs 
(300 uM) in M.Hhal reaction buffer were incubated at 
37°C for 4h. Samples were then digested with Nuclease 
PI (1.5 u) and calf intestine alkaline phosphatase (30 u) at 
42° C for 4h. Digested DNA samples were loaded onto a 
reversed-phase HPLC column (Discovery HS CI 8, 3um, 
75 x 2.1mm, Supelco) and eluted for 3 min with 20 mM 
ammonium formate (pH 3.5), then by a linear gradient of 
methanol to 20% in 15 min, followed by a linear gradient 
from 20% to 80% methanol in 2 min and 80% methanol 
for 5 min at a flow of 0.3 ml/min and at 30°C. Post-column 
equal co-flow of 96% methanol, 4% formic acid and 
1 mM sodium formate was used for the MS detection of 
modified nucleosides and its derivatives in the 50-500 m/z 
range. 

Transformation of Escherichia coli with alkylated DNA 

2-3 ug of pUC19 plasmid DNA (reaction volume 20-30 ul, 
1 uM Hhal sites) was modified with 300 uM AdoMet, co- 
factors 2 or 4 in the presence of 0.5 uM M.Hhal (variant 
Q82A/Y254S/N304A). Modified DNA was extracted 
twice with an equal volume of Roti®-Phenol/C/I mix, 3 
times with equal volume of chloroform, precipitated with 
propanol-2 and dissolved to a final concentration of 
0.4 ug/ul in water. 

Aliquots (0.8 rig) of modified pUC19 were digested with 
R.Hin6I or McrBC according to manufacturer recommen- 
dations. Samples were analysed by 1.5% agarose gel elec- 
trophoresis and EtBr staining. For the transformation 
experiments, equal aliquots of modified pUC19 DNA 
(1 ng/ul) were used to transform competent E. coli 
JM109 mcrB + and mcrBT cells prepared according to 
methodology described in (20). The Mcr restriction rate 
in vivo is determined as the ratio of ampicillin resistant 
colonies obtained after growing these strains for 16 h at 
37°C on the LB agar medium supplemented with ampicil- 
lin (O.lmg/ml). 

M2.Eco31I-directed labeling of lambda DNA 

Reaction solution containing 72 nM M2.Eco31I (variant 
N127A/Q233A) and 1.8 nM phage lambda DNA in 
reaction buffer (50 mM Tris-HCl pH 7.4, 2.5 mM 
EDTA, 0.2mg/ml BSA, 20 mM NaCl, 0.4 mM 2-mer- 
captoethanol) with 40 uM cofactor 4 was incubated at 
37°C for 1 h. Afterwards samples were mixed with an 
equal amount of water and extracted with Roti®-Phenol 
(pH 8.0), twice with Roti®-Phenol/C/I and three times 
with chloroform and DNA was precipitated with 
propanol-2. Pellets were washed with ice cold 75% 
ethanol and dried. Amino-modified DNA was dissolved 
in water to a final concentration of 0.2 ug/ul and treated 
with fluorescein-NHS (125 uM) in 0.15 M sodium bicar- 
bonate (pH 8.3) for 1 h at room temperature. 
Approximately 1.5 ug of labeled DNA was fragmented 
with R.Eco91I and analysed by 0.8% agarose gel electro- 
phoresis. Gels were scanned with a Fuji FLA-5100 
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imaging system using a 473 nm laser and an LPB filter set 
for fluorescein emission. 

RESULTS 

Steric engineering of M.Hhal for improved 
transalkylations 

Previously we found that the reactivity of M.Hhal with 
extended AdoMet analogs was increased by replacing 
Gln82 to alanine. The observed rates were still lower as 
compared to those achieved with the natural cofactor (9), 
although in theory the S N 2 alkylation rates with propargyl 
and allyl compounds should be similar or higher than the 
methylation rates (21). Directed evolution of M.Hhal for 
altered sequence specificity produced variants in which 
Tyr254 is replaced with serine, and which also showed 
an enhanced transalkylation activity with certain 
extended cofactor analogs (17). To guide our engineering 
effort, we built a structure-based model of M.Hhal- 
DNA-(cofactor 2) complex (Figure 2A and B). The 
model structure showed that Gln82, Tyr254 and Asn304 
residues are in close proximity and thus might sterically 
interfere with the extended transferable side chains. We 
therefore selected these positions for steric engineering of 
the cofactor pocket. Gln82 and Asn304 were replaced with 
an Ala; due to extensive solvent exposure and close inter- 
actions with bound DNA Tyr254 was replaced with a 
small polar residue (Ser). 

Initially, the three single mutants were analysed 
by comparing their activity with AdoMet and three of 
its extended analogs using a DNA protection assay 



(Figure 2C). This semi-quantitative end point assay 
(Supplementary Figure SI) estimates the number of en- 
zymatic turnovers based on the number of modified 
target sites in substrate DNA (9). The estimated 
turnover rate for the WT enzyme with AdoMet 
(2.1 min -1 ) was in good agreement with published 
^cat = 1-2 min -1 observed on a variety of DNA substrates 
(19), but no activity was observed with the extended 
cofactor analogs. All single-mutant enzymes dem- 
onstrated an increased activity toward the AdoMet 
analog 2. The Gln82 substitution resulted in a 4-fold 
increased enzymatic activity with cofactor 2 and an 
~ 16-fold decreased activity with AdoMet. A barely detect- 
able activity was observed with cofactors 3 and 4 respect- 
ively. In contrast, the Y254S mutation caused little effect 
on the methylation activity, but significantly increased ca- 
talysis of the chain transfer from cofactors 2 and 3. The 
most dramatic activity enhancement came from the 
N304A mutation which showed an 8-fold and 4-fold 
faster transfer of the four-carbon and five-carbon chains 
from analog 2 and 3 as compared to the methyl group. 
However, none of the single-mutant variants showed de- 
tectable reactivity with the most extended cofactor 4. This 
observation suggests that although the shortening of the 
304 residue creates adequate space in the active site for the 
transfer of moderately extended groups, it fails to accom- 
modate the transfer of the larger functionalized group. 

To further optimize M.Hhal for the transalkylation 
reactions, all combinations of double and triple mutants 
involving the above three residues were constructed and 
examined. Surprisingly, none of the double (Q82A/N304A 
or Y254S/N304A) or triple (Q82A/Y254S/N304A) mutant 




Figure 2. Engineering the transalkylation reactions in the Hhal methyltransferase. Models of cofactor 2 bound in the active site of the WT (a) and 
an engineered variant (b) of M.Hhal (Q82A/Y254S/N304A) with DNA. Modeling was based on M.Hhal-DN A- AdoMet X-ray structure (PDB code 
6mht). Red arrows point along the trajectory from attacking the C5 atom of the target cytosine residue to the transferable carbon in the cofactor. 
Pictures are made with Swiss-PdbViewer program (3.7 sp5 version) (22). (c) AdoMet and its analogs activity with WT M.Hhal and its variants. 
Turnover rates were estimated using a DNA protection assay. 
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Figure 3. Effects of steric engineering on cofactor binding and catalysis by M.Hhal. Left, binding affinity (^ D cofactor ) of cofactors 1-5 in the ternary 
pre-catalytic complex M.Hhal-DNA-cofactor was determined for a catalytic (C81S) mutant and a corresponding engineered variant (C81S/Y254S/ 
N304A); right, single-turnover reaction rates (& C hem) of the WT and engineered (Y254S/N304A) M.Hhal with cofactors 1 and 2 as indicated. 



enzymes gave a significant enhancement in transferring the 
alkyne groups from cofactor 2 or 3 compared to 
the N304A variant (Figure 2C). However, the transfer of 
the largest group is accelerated in the Y254S/N304A and 
Q82A/Y254S/N304A variants, which both show a simi- 
larly high activity with the cofactor analog 4. Moreover, 
the Q82A mutants displayed a small, if any, enhancement 
of the transalkylation rate, but led to consistent reduction 
of the methylation rate (Figure 2C). Altogether, 
both double mutants involving N304A and the triple 
mutant accepts the largest cofactor 4 in par with 
AdoMet; the achieved turnover rates of 0.1 3-0.25 min -1 
(Supplementary Table S2) permit complete derivatization 
of DNA in < 1 5 min, which makes the reaction suitable for 
routine laboratory applications. 

The sequence specificity of the engineered M.Hhal 
variants in the presence of cofactor analogs is indirectly 
attested by the DNA protection experiments, in which 
complete protection of Hhal sites in bacteriophage DNA 
was observed. To verify that the protein engineering does 
not alter the specificity of base modification, an oligo- 
nucleotide duplex was modified by the triple mutant 
Q82A/Y254S/N304A and the model cofactor analogs 
and its nucleoside composition was determined using 
reversed-phase HPLC. In addition to the four major nu- 
cleosides, novel compounds with longer retention times 
were observed and identified as corresponding alkylated 
2'-deoxycytidines using in-line ESI-MS analysis 
(Supplementary Figure S2). 

Mechanistic effects of the Y254S/N304A mutations 

To assess whether the cofactor binding or the catalytic 
power is a major factor for the enhanced alkylation 
activity, the best performing and least altered variant 
(Y254S/N304A) was subjected to further biochemical 
studies. First, we determined the binding affinity of the 
cofactors (^ D cotactor ) in the ternary pre-catalytic complex 
M.Hhal-DNA-cofactor. Catalytic turnover of the ternary 
complex during the cofactor binding measurements was 
abolished by replacing the catalytic Cys81 residue with a 
serine (23) in both the wild-type enzyme and in the engin- 
eered variant. The cofactor binding experiments 
(Supplementary Figure S3) exploited fluorescence 
changes of a unique tryptophan (Trp41) (19). 



Surprisingly, we found that binding of the analogs 2 and 
4 to the engineered variant of M.Hhal (C81S/Y254S/ 
N304A) was barely improved (<2-fold) as compared to 
the catalytic C81S mutant (Figure 3, left). On the other 
hand, we found a strong negative effect on AdoMet 
(4-fold) and AdoHcy (60-fold) binding. 

Furthermore, we determined the single-turnover 
reaction rates (k chem ) of the WT and Y254S/N304A 
using rapid-quench flow technique (Supplementary 
Figure S4). Remarkably, under these saturating condi- 
tions ([E]>[DNA], [cofactor] >>^ D cofactor ), the mutant 
enzyme displayed a 24-fold enhanced transfer of the 
butynyl groups from cofactor 2 as compared to the WT 
enzyme, whereas the rate of methyltransfer was decreased 
by a similar margin (27-fold) (Figure 3, right). Altogether, 
our analyses indicate that the described mutations lead to 
changes in the catalytic transfer rather than cofactor 
binding. In addition, a weaker retention of the reaction 
product AdoHcy in the engineered enzyme also contrib- 
utes to the overall catalytic efficiency in vitro since it leads 
to a faster dissociation of product complex MTase- 
modified DNA- AdoHcy (19) and reduced non-productive 
binding of AdoHcy to the binary M.Hhal-DNA complex 
(Figure IB). 

Enzymatic transalkylation of DNA in the 
presence of AdoMet 

Taking together the enhanced catalytic power of the en- 
gineered M.Hhal variants in alkyltransfer reactions (espe- 
cially with cofactors 2 and 3) and their reduced affinity 
toward the natural cofactor one might expect a shifted or 
reversed cofactor preference during reaction. Efficient 
aminoalkylation reactions in the presence of endogenous 
AdoMet would potentially permit in situ or even in vivo 
derivatization and subsequent labeling of DNA. Our 
DNA protection experiments indicated that the turnover 
rates of the triple (Q82A/Y254S/N304A) mutant were 
similar with AdoMet or the Ado- 11 -amine cofactor 4 
(Figure 2C), however the cofactor preference of the 
enzyme in direct competition between the two cofactors 
remained unclear. As a proof of principle, enzymatic 
modification of pUC19 plasmid DNA in the presence of 
varied molar ratios of AdoMet to its analog was per- 
formed. Methylation and alkylation of DNA was 
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Figure 4. mTAG aminoalkylation and labeling of DNA in the presence 
of AdoMet. pUC19 plasmid was modified with M.Hhal (Q82A/Y254S/ 
N304A mutant) and a mixture of AdoMet and cofactor 4 supplied in 
different molar fractions as indicated (total cofactor concentration was 
50 uM). Modified DNA was labeled with Cy5-NHS, fragmented with 
R.BseSI and separated by agarose gel electrophoresis. Bulk DNA was 
visualized after staining with EtBr. Visualization was performed using 
473 nm (EtBr) and 635 nm (Cy5) lasers. C is control sample incubated 
without cofactor; M, DNA size marker GeneRuler™ 100 bp Plus 
DNA Ladder. 



(a) 



16 



■o 




o 


12- 


ion, 




o 






8- 


resl 




BC 


4- 


o 
2 





3_ 



No cofactor AdoMet (1) Cofactor 2 Cofactor 4 



(b) 





No cofactor 


AdoMet (1) 


Cofactor 2 


Cofactor 4 




<5 o 




<5 o 


3 o 




.S CD 




.S m 






=c b 


b 


=c o 


=c b 


/I 




- en 2 


- QL S 


- QL 5 



probed using combination of the R.Hin6I and R.BspLI 
endonucleases (Supplementary Figure S5). Moreover, in- 
corporation of the extended chain with a reactive amino 
group was assessed by labeling the modified DNA with 
Cy5-NHS (Figure 4). Visualization of Cy5-labeled DNA 
showed that Q82A/Y254S/N304A variant is capable of 
efficiently alkylating DNA in presence of physiologically 
relevant concentrations of AdoMet (8.5-300 uM) (24). 
The label is incorporated even in presence of 5-fold 
excess of AdoMet (Figure 4). This result opens the possi- 
bility for sequence-specific labeling of DNA in vivo or in 
the cell in presence of endogenous AdoMet. 

Restriction of C5-alkylated DNA in bacterial cells 

The presence of natural cytosine modifications in DNA 
such as 5mC, 4mC or 5hmC is thoroughly surveyed by 
dedicated DNA restriction systems in bacteria. For 
example, the McrBC system recognizes the above 
modified cytosine residues in certain sequence contexts 
and destroys modified DNA by endonucleolytic 
cleavage, whereby restricting entry of many types of 
methylated and hydroxymethylated plasmids into E. coli 
cells (25). For this reason, plasmid DNA premethylated by 
M.Hhal (Figure 5) and many other DNA MTases is 
strongly restricted by the McrBC system both in vitro 
and in vivo. However, the sensitivity of the system to 
DNA artificially modified with larger groups has not 
been studied before. Experiments presented in Figure 5, 
bottom panel, show that the McrBC endonuclease fails to 
attack the DNA modified using AdoMet analogs 2 
or 4. The McrBC system renders a 14-fold restriction of 
the M.Hhal-methylated plasmid, however equal trans- 
formation efficiency was observed in experiments 
involving alkylated plasmid DNAs in either mcrB~ 



Figure 5. Sensitivity of mTAG-alkylated DNA to the McrBC endo- 
nuclease in vitro and in vivo, (a) The McrBC restriction rate in vivo 
is determined as the ratio of ampicillin resistant colonies obtained in 
mcrB" versus mcrB + strains after transformation with pUC19 plasmid 
DNA modified in the presence of cofactor AdoMet or its analogs 
2 and 4. (b) mTAG-modified pUC19 DNA as above was treated with 
the McrBC endonuclease or R.Hin6I (control for complete modifica- 
tion) in vitro and analysed by agarose gel electrophoresis. no endo- 
nuclease added; M, DNA size marker GeneRuler™ 100 bp Plus DNA 
Ladder. 

or mcrB + E. coli strain (Figure 5, top panel). These obser- 
vations clearly indicate that the McrBC restriction system 
is inert to mTAG-alkylated DNA in vitro and in vivo. This 
observation is consistent with recent McrBC-DNA 
co-crystal structures, which show tight binding of the 
flipped out 5-hydroxymethylcytosine (26), however 
residues carrying larger groups seem unlikely to be 
accommodated in the binding pocket. 

Homology-based engineering of M.Hpall and M2.Eco31I 

The structural conservation of C5-MTases and successful 
engineering of M.Hhal suggested that other orthologs can 
be similarly engineered based on sequence alignment even 
in the absence of crystal structures (Figure 6). We first 
turned to the Hpall C5-MTase (M.Hpall), which recog- 
nizes CCGG targets in DNA (27) and which thus can 
potentially be useful to study mammalian DNA modifica- 
tion at CpG sites. As expected, the WT enzyme showed 
low or no activity with AdoMet analogs 2 and 4, respect- 
ively. Replacement of the Q104 and N335 residues 
(correspond to Q82 and N304 in M.Hhal) to alanines 
resulted in an enzyme with a significantly enhanced 
activity toward these analogs (Figure 6, Supplementary 
Figure SIB and Supplementary Table S3). 
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Figure 6. Homology-based engineering of C5-MTases in conserved sequence motifs IV and X. (a) Amino acid sequence alignment of regions 
corresponding to IV and X conserved motifs of prokaryotic and eukaryotic cytosine-C5 MTases. Arrows indicate Gln82 and Asn304 positions of 
M.Hhal in IV and X conserved motifs. Four degrees of conservation in descending order: black background with white text, dark gray background 
with white text, gray background with black text and white background with black text. Amino acid conservation of the Gln82 position (Hhal) in 
DNA C5-MTases is depicted, based on an alignment of ~530 DNA C5-MTase sequences from the Pfam database (http:www.sanger.ac.uk/Software/ 
Pfam/) (b) Permutation of conserved motifs in the M.Hhal, M.Hpall and M2.Eco31I DNA methyltransferases. (c) Enzymatic transalkylation of 
DNA by engineered variants of M.Hpall and M2.Eco31I. Turnover rates (shown in logarithmic scale) were estimated using the DNA protection 
assay. 



Another studied example was the bacterial C5-MTase 
Eco31IC (M2.Eco31I) which recognizes an asymmetric 
hexanucleotide sequence GGTCTC in DNA and 
attaches methyl group to the inner cytosine residue on 
one strand of the target site (28). Therefore this MTase 
could be used for strand- and sequence-specific labeling of 
larger DNA molecules. Initial testing of the wild-type 
enzyme showed that it possesses low activity toward 
cofactor 2 and negligible activity with cofactor 4. The 
single mutation N127A (corresponds to N304A in 
M.Hhal) resulted in a significant increase of the 
alkyltransferase activity with cofactors 2 and 4. However 
the N127A/Q233A variant was capable of catalysing 
extended group transfer from AdoMet analog 4 with a 
50-fold higher efficiency as compared to the WT MTase 
(Figure 6, Supplementary Figure SIC and Supplementary 
Table S3). Bacteriophage lambda contains only two 
Eco31I target sites, and is thus well suited to demonstrate 
a highly specific fluorescent labeling of a large natural 
DNA molecule (Supplementary Figure S6). 

These examples clearly show that a high degree of struc- 
tural conservation of the catalytic domain of C5-MTases 
permits successful engineering of the methyltransferase 
reaction for sequence-specific labeling of DNA. 

DISCUSSION 

Structure-guided engineering of the cofactor pocket in 
M.Hhal produced four variants that show faster transfer 



of extended linear groups containing four or five carbon 
units (cofactor analogs 2 and 3) as compared to the 
natural AdoMet cofactor (Figure 2C). The transfer of 
the butyne group from cofactor 2 can occur several-fold 
faster (lOmin" 1 ) than the catalytic rate of WT M.Hhal 
with AdoMet (1.3 min -1 ) (19) and approaches the rate of 
methyltransfer observed with WT M.Hhal under 
single-turnover conditions (15-30min -1 ) (Figure 3, and 
(19)). This indicates that the Hhal methyltransferase has 
been actually converted into a sequence-specific DNA 
alkyltransferase and that a full catalytic power has been 
attained in the engineered C5-transferase reaction. 
Further extension of the transferable side chain to 12 
linear units (cofactor 4) leads to somewhat lower but 
still useful turnover rates of 0.3 min -1 . 

In contrast to our expectations, a major contributor to 
the efficiency of the transalkylation reactions in the engin- 
eered MTase turned out to be enhanced catalytic transfer 
rather than improved cofactor binding. In other words, 
the extended cofactors can bind to the WT and to the 
mutant variants of M.Hhal with similar affinity, 
however, the structure of the ternary complex in the en- 
gineered proteins is more favorable for catalysis. 
Inspection of the available crystal structures indicates 
that a wider solvent channel in the cofactor pocket 
permits a higher conformational flexibility of the 
extended cofactor, and rotation of the cofactor side 
chain into the newly created space may position the trans- 
ferable carbon atom into a more favorable pre-catalytic 
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conformation in which one of the two hydrogen atoms 
assumes a less obstructive position for an in-line attack 
of the C5 nucleophile (Supplementary Figure S7). 

The improvement of the catalytic rate with the extended 
cofactors was accompanied with a substantial reduction of 
AdoHcy (60-fold) and AdoMet (4-fold) binding and a 
strong decrease of the rate of methylation (30-fold). 
Among the three mutated residues, Gln82 proved to be 
the most important for the methyl group transfer since all 
of the M.Hhal variants containing the Q82A mutation 
displayed a significant reduction of the methylation rate 
but no change of the transalkylation rates (Figure 2C). 
This residue is located in the mobile catalytic loop and 
thus the Q82A mutation reduces the number of intramo- 
lecular contacts to the core of the protein (N304A and 
Y254S would be expected to have similar but smaller 
effects), which in turn may destabilize the closed conform- 
ation of the mobile catalytic loop in the enzyme leading to 
a reduction of the apparent methylation rate. The reason 
for the strongly impaired binding of AdoHcy as compared 
to AdoMet and the other extended cofactors is not fully 
clear. One might expect that, in general, cofactors with 
smaller surface areas would be less efficient in bridging 
the expanded space between the catalytic loop and the 
rest of the protein, which is critical for to maintaining 
the closed catalytic conformer. On the other hand, a 
higher conformational flexibility of the thioether moiety 
in AdoHcy (as compared to tri-substituted sulfonium 
centers) may be another destabilizing factor in the 
context of fewer restraining contacts from adjacent 
protein residues. As discussed above, the weaker retention 
of AdoHcy contributes to the overall catalytic efficiency of 
the engineered enzyme due a faster dissociation of product 
complex AdoHcy-MTase-modified DNA and reduced in- 
hibition potency of AdoHcy (Figure IB). 

Since two of the three mutations lie within highly 
conserved sequence motifs (IV and X), corresponding 
mutations in other DNA C5-MTases were expected to 
improve the acceptance of the extended AdoMet 
analogs. Although the magnitudes of the observed 
effects estimated in DNA protection experiments 
appeared quite distinct, both M.Hpall and M2.Eco31I 
showed improved reactivity toward AdoMet analogs at- 
testing a general utility of homology-based engineering of 
the cofactor pocket in C5-MTases. For M2.Eco31I, which 
is an example of sequence permutation in C5-MTases 
(Figure 6B), the cofactor preference of the double 
mutant was very similar to that of M.Hhal (cofactor 
2 > cofactors 1, 4), whereas the engineered M.Hpall still 
showed a profile characteristic of a WT MTase (cofactor 
1 > cofactor 2 > cofactor 4). Nonetheless, taking into 
account a common structural organization of the 
AdoMet binding pocket (3,6,29) and recent examples 
from protein MTases (30,31), successful engineering of 
other classes of DNA and RNA MTases can be well 
anticipated. 

The MTase-directed derivatizations of DNA occur in a 
highly specific manner, with full retention of sequence-, 
base- and atomic specificity whereby a linear moiety is 
attached on a DNA molecule. Such labeling offers an im- 
portant advantage over endonuclease-polymerase-based 



techniques (32) since it does not rely on replacing 
natural nucleotides with modified ones in adjacent 
regions and thus preserves all pre-existing modifications 
in the DNA, which could be further subjected to investi- 
gation. Examples of practical application of 
sequence-specific mTAG labeling for single-molecule 
genotyping (13) and epigenome profiling (Kriukiene 
et al., submitted elsewhere) have recently been provided. 
Moreover, we demonstrate that (i) mTAG labeling can be 
performed in the presence of competing AdoMet 
(Figure 4) and that (ii) mTAG-modified DNA is 
immune to modification-dependent restriction systems 
such as McrBC of E. coli (Figure 5), which open ways 
to studying artificially modified DNA in bacterial cells. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Tables 1-3, Supplementary Figures 1-7, 
Supplementary Methods and Supplementary Reference 
[33]. 
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